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Abstract 

The classical dynamic programming-based optimal stochastic control methods fail to cope 
with nonseparable dynamic optimization problems as the principle of optiniality no longer 
^ , applies in such situations. Among these notorious nonseparable problems, the dynamic 

' mean-variance portfolio selection formulation had posted a great challenge to our research 

community until recently. A few solution methods, including the embedding scheme, have 
been developed in the last decade to solve the dynamic mean-variance portfolio selection 
formulation successfully. We propose in this paper a novel mean-field framework that offers 
a more efficient modeling tool and a more accurate solution scheme in tackling directly the 
, issue of nonseparability and deriving the optimal policies analytically for the multi-period 

\^ • mean-variance-type portfolio selection problems. 
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1 Introduction 

. The mean-field type of optimal stochastic control models deals with problems in which both the 

system dynamics and objective functional could involve the states as well as the expected values 
of the states. The past few years have witnessed an increasing number of successful applications 
of the mean-field formulation, including mean-field type of stochastic control problems, in various 
fields of science, engineering, financial management, and economics. Although the research in 
this direction has been well developed for continuous-time control problems, it lacks progress 
in both theoretical investigation and applications in discrete-time problems. The current work 
in this paper aims to employ the mean-field formulation to cope with seemingly non-tractable 
nonseparability in discrete-time portfolio selection problems. In particular, we revisit three 
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challenging, yet practically important, portfolio selection models over a finite-time investment 
horizon (see Li and Ng [16j . Costa and Nabholz [lOj, Zhu et al. [28j), reformulate them as 
discrete-time linear-quadratic control problems of a mean-field type, and derive their optimal 
strategies with improved solution qualities. 

Since Markowitz [18] published his seminal work on the mean-variance portfolio selection 
sixty years ago, the mean-risk portfolio selection framework has become one of the most sig- 
nificant ingredients in the modern financial theory. An important yet essential research theme 
under the mean-risk portfolio selection framework is to strike a balance between achieving a high 
mean of the investment return and minimizing the corresponding risk. If we adopt the variance 
of the terminal wealth as a risk measure for investment, we have the following mathematical 
formulations of the classical static mean- variance models, 

iMV{a)) max E(xi), 

s.t. Var(xi) < a^, 
xi = xo + uo • So, 

and 

{MV{e)) min Var(xi), 
s.t. E(xi) > e, 

Xi = Xo + Uo • So, 

which are equivalent to 

(MVs) max £(2:1) - a;Var(xi), 
s.t. Xi = Xo + Uo » So, 

where xt is the wealth at time t, ut is the portfolio strategy at time t, St is the random return at 
time t, xo+uo*So denotes the random terminal wealth xi from applying strategy uo in the market 
with initial wealth xq, and w > denotes the trade-off between the two conflicting objectives of 
maximizing expected return and minimizing the risk. The optimal portfolio strategy and solution 
scheme of {MVs) can be found in Merton [20] when shorting is allowed and in Markowitz |18] 
when shorting is prohibited. 

However, the extension to a dynamic version of mean-variance portfolio selection was blocked 
for four decades until recently. Let us consider the following abstract form for the dynamic 
mean-variance portfolio selection problem, 

{MV{oj)) max E(xr) - a;Var(xT), 

u 

s.t. XT = Xo + {ut • St} \^^o ^ 

where xo+{ut»St} 1^1^^ denotes the random terminal wealth xt from applying strategy {ut} 

in the market with initial wealth xq- Due to the non-smoothing property of the variance term, 

i.e., 

Var(Var(-|J-i)| J-,) Var(-|7-,), Vi < j, 

where J-j is the information set available at time j and J'j^i C Tj, {MV{oj)) is not a standard 
stochastic control problem whose objective functional involves the wealth state as well as a 
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nonlinear function of the expected wealth and, thus, does not satisfy the principle of optimality. 
Therefore, all the traditional dynamic programming-based optimal stochastic control solution 
methods no longer apply. 

We now briefly summarize the main approaches in the current literature to overcome the 
difficulty resulted from the nonseparability. Adopting an embedding scheme, Li and Ng [TB] and 
Zhou and Li ^27J considered the following family of auxiliary problems, A{uj, A), parameterized 
in parameter A, 

A{uj,X) min E(a;x|' — Axt), 

u 

S.t. XT = Xo + {Ut • St} iJ^Q . 

Note that problem A{uj, A) is a separable linear-quadratic stochastic control (LQSC) formulation 
and can be thus solved analytically. Li and Ng [16] and Zhou and Li [27] derived the optimal 
policy to the primal nonseparable problem {MV{u})) via identifying the optimal parameter 
A and applying the optimal A* to A{uj,X). The embedding scheme has been also extended 
to multi-period mean-variance model with intertemporal restrictions (see Costa and Nabholz 
|10j). multi-period mean- variance model in a stochastic market whose evolution is governed by 
a Markovian chain (see (^elikyurt and Ozekici [5]), a generalized mean- variance model with 
risk control over bankruptcy (see Zhu et al. [2S]), and dynamic mean- variance asset- liability 
management (see Leippold et al. [15], Chiu and Li [9], Chen and Yang [8]). 

By introducing an auxiliary variable d and an equality constraint E{xt) = d for the expected 
terminal wealth, Li et al. [17] paved the road to study the following slightly modified, albeit 
equivalent, version of {MV{u})) (we omit the no-shorting constraint here and focus on the model 
itself), 

{MV{d)) min Var(xT) = E(xt - d)^ 

u 

s.t. E(xr) = d, 

XT = xo + {ut • St} \ . 

Introducing a Lagrangian multiplier A and applying Lagrangian relaxation to {MV{d)) gives 
rise to the following LQSC problem, 

(L(A)) min E{xt - df - XE{xt - d) , (1) 
s.t. XT = xo + {ut • St} IJ'q^ . 

The optimal policy of {MV{d)) can be obtained by maximizing the dual function L(X) over all 
Lagrangian multiplier A G M. In fact, the Lagrangian problem {L{X)) can be further written as 
the following LQSC problem, 

{MVH{m)) min E(xt - m)^ (2) 
s.t. XT = xq + {ut • St} , 

where m = d + A/2. Problem {MVH{m)) is a special mean-variance hedging problem, in 
which an investor hedges the target m by his/her portfolio under a quadratic objective function. 
Problem {MVH{m)) has been well studied and can be solved by LQSC theory (see Li et al. 
[TT]). martingale/convex duality theory (see Schweizer [23], Xia and Yan [25]) and sequential 
regression method (see Cerny and Kellsen [6]). 
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In all the literature mentioned above, a static optimization procedure is always necessary 
to identify an optimal parameter in the parameterized auxiliary problem ^(a;,A), (-^^(A)) or 
{MVH{m)). Actually, based on the pure geometric structure of {MV{u})), Sun and Wang [21] 
proved that the optimal terminal wealth takes the following form, 

Xt = xo + r / c 1 iT-iM t • \t=0' 

2ujE{l-{^fSt} \t=o ) 

where 93* is the policy of the following particular mean-variance hedging problem, 

(MVH{1)) min E{xt - l)^ 

S.t. XT = {(ft • St} \ JJq^ . 



All the above approaches attempt to embed the "nontractable" nonseparable mean-variance 
portfolio selection problem into a family of tractable LQSC problems. Although these transfor- 
mations seem necessary, one meaningful yet challenging question emerges naturally: Are we able 
to directly tackle the above nonseparable dynamic mean-variance problems (without introducing 
an auxiliary problem)? 

The mean-variance problem is in fact a special case of the mean-field type problems where 
both the underlying dynamic system and the objective functional involve state processes as well 
as their expected values (hence the name mean- field). This critical feature differentiates the 
mean- variance problem from standard stochastic control problems. The theory of the mean-field 
stochastic differential equation can be traced back to Kac [H] who presented the McKean-Vlasov 
stochastic differential equation motivated by a stochastic toy model for the Vlasov kinetic equa- 
tion of plasma. Since then, the research on related topics and their applications has become a 
notable and serious endeavor among researchers in applied probability and optimal stochastic 
controls, particularly in financial engineering. This new direction, however, requires new ana- 
lytical tools and solution techniques. For instance, in a recent research on mean-field forward 
stochastic LQ optimal control problems, Yong p6] introduced a system of two Riccati equations 
to solve the problem. Representative works in mean-field include, but not limited to, Mckean 
|19j . Dawson [12], Chan [7], Buckdahn et al. [1], Borkar and Kumar [2], Crisan and Xiong 
Andersson and Djehiche [1], Buckdahn et al. [3], Meyer-Brandis et al. [21], Nourian et al. [22] 
and Yong [26]. Despite active research efforts on mean- field in recent years, the topic of multi- 
period models in discrete-time remains a relatively unexplored subject where the mean-field 
modeling scheme has not yet been applied. 

In this paper, we will develop a unified framework of mean-field formulations to investi- 
gate three multi-period mean-variance models in the literature: classical multi-period mean- 
variance model in Li and Ng [16], multi-period mean- variance model with intertemporal re- 
strictions in Costa and Nabholz [10], and a generalized mean- variance model with risk control 
over bankruptcy in Zhu et al. [28j . We demonstrate that the mean-field approach represents 
a new promising way in dealing with nonseparable stochastic control problems related to the 
mean-variance formulations and even improves solution quality of some existing results in the 
literature. 
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2 Mean-Field Formulations for Multi- Period Mean- Variance Port- 
folio Selection 



We consider in this paper a capital market consisting of one riskless asset and n risky assets 
within a time horizon T. Let St (> 1) be a given deterministic return of the riskless asset at 
period t and et = [ej,--- ,ef]' the vector of random returns of the n risky assets at period 
t. We assume that vectors et, t = 0, 1, ■ ■ ■ , T — 1, are statistically independent and the 
only information known about the random return vector is its first two moments, its mean 
E(et) = [E{el),'E{ef), ■ ■ ■ ,E(ef )]' and its positive definite covariance 



Cov(et)=E(ete;)-E(et)E(e;) = 
From the above assumptions, we have 



<7t,ll • • ■ 

y 0. 



CtAn 



y 0. 



We further define the excess return vector of risky assets as 

P, = [Pt',P^, • • • , Pn' = [{el - St), {el - • • • , (e? - st)]' . 
The following is then true for t = 0, 1, . . . , T — 1: 



si stE{V't) 
stE(Pt) E(PtPi) 



1 0' 
-1 / 



si stE(et') 
stE(et) E(etet') 



1 0' 
-1 / 



^0, 



where 1 and are the n-dimensional all-one and all-zero vectors, respectively, and / is the n x n 
identity matrix, which further implies 

E(PtP;)^o, Vi = o,i,--- ,r-i, 

sl{l-Bt)>Q, Vi = 0,l,--- ,T-1, 

where Bt = E(P;)E-i(PtPj)E(Pt). 

An investor joins the market at the beginning of period with an initial wealth xq. He/she 
allocates xq among the riskless asset and n risky assets at the beginning of period and re- 
allocates his/her wealth at the beginning of each of the following (T — 1) consecutive periods. 
Let xt be the wealth of the investor at the beginning of period t, and ul, i = 1,2, ■■ ■ ,n, be 
the amount invested in the i-th risky asset at period t. Then, xt — ^i^l be the amount 

invested in the riskless asset at period t. The information set at the beginning of period t is 
denoted as 

Ji = (7(JbV(7(Po,Pl,--- ,Pt-l)), 

where To contains Xq, st and the first and second moment information of Pt, t = 0,1, - ■ ■ ,T — 1. 
We confine an admissible investment strategics to be J-t-measurable Markov control, i.e., € J^f 
Then, Pj and Ut are independent, {xt} is an adapted Markovian process and J^t = o'{J^o\/ a{xt)). 
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The conventional multi-period mean-variance model is to seek the best strategy, = 
[(n|)*, (Uf)*, • • • , (u")*]', t = 0, 1, • • • ,T — 1, which is the optimizer of the following stochas- 
tic discrete-time optimal control problem, 

(MMV) max E(j;t) - WTVar(xT), (3) 

n / ™ \ 

s.t. xt+i = '^elul+ ixt-'^uljst 

= Sf2;t + P;ut, t = 0,l,--- ,r-l, (4) 

where lot > is the trade-off parameter between the mean and the variance of the terminal 
wealth. 

The multi-period mean-variance model with intertemporal restrictions is to find the optimal 
control of the following problem, 

{MMV - IR) max ^ at [lt^{xt) - pt\w{xt)] , 
tela 

s.t. xt+i = StXt + P'tUt, t = 0, 1, • • • ,r - 1, 

where la = {ti, ■ ■ ■ , r^} with = T is the set of time instances on which the investor evaluates 
the performance of the portfolio, atit and atpt > are the time-t weights of the mean and 
the variance in the objective functional. In particular, if we choose la = {T}, ut^t = 1 and 
oltPt = wt > 0, {MMV — IR) reduces to the conventional multi-period mean- variance portfolio 
selection model {MMV) studied in Li and Ng [IFj. If la contains time instances other than 
T, {MMV — IR) is the multi-period portfolio selection problem with intertemporal restrictions 
considered in Costa and Nabholz [10]. Without loss of generality, we let la include all time 
instants from to T, while setting some at = it = Pt = for these time instances which do not 
need to be evaluated. 

The generalized mean-variance model for dynamic portfolio selection with risk control over 
bankruptcy is formulated as 

{MMV - B) max E(xt) - ojt^sx{xt), 

s.t. xj+i = sjxt + P[ut, t = 0, 1, • • • ,r - 1, 
P{xt<bt)<at, t = l,2,--- ,T-1, 

where ht is the disaster level and at is the acceptable maximum probability of bankruptcy set 
by the investor. By Tchebycheff inequality, problem {MMV — B) can be transformed into the 
following {GMV) model (see Zhu et al. [28j), 

{GMV) max ¥.{xt) - ujtYw{xt) , 

s.t. xt+i = stXt + 'P'tVit, i = 0, 1, • • • ,T - 1, 

Var(xt) < at [^{xt) - hf , t = 1, 2, • • • , T - 1. 

To solve {GMV), let us consider the Lagrangian maximization problem, 

T-l 

{L{u)) max E(xt) - UT^a.T{xT) - ^^^t |^Var(xt) - at (E(xt) - 6t)^ , 

t=i 

s.t. xt+i = stXf -I- P^uj, t = 0, 1, • • • ,r - 1, 
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where u = (a;i,a;2, ■ ■ ■ jUJt-i)' G ^ is the vector of Lagrangian multiphers. 

We are now building up the mean-field formulations for problems (MMV — IR) and {L{uj)), 
respectively. For t = 0, 1, • • • ,T — 1, the evolution of the expectation of the wealth dynamics 
specified in dH) can be presented as 

E(xt+i) = stE(xi)+E(P^)E(ut), 
E(xo) = xo, ^ ' 

due to the independence between and u^. Combining and ^ yields the following for 

t = o,i,--- ,r-i, 

' xt+i - E(xt+i) = st{xt - E(xt)) + Pjut - E(Pj)E(ut) 

= st {xt - E(xi)) + Pj(ui - E(ut)) + (Pi - E(Pj))E(ut), (6) 
xo - E(xo) = 0. 

What we are actually doing here is to enlarge the state space {xt) into (E(x(), — E(xt)) and 
the control space (u^) into (E(uf),ut — E(uf)). Although control vector E(ut) and — E(ut) 
can be decided independently at time t, they should be chosen such that 

E(ut-E(ut)) = 0, t = 0,l,--- ,T-1. 

We also confine admissible investment strategies (E(ut), — E(ut)) to be J^^-measurable Markov 
control. Then, {(E(xf),xt — E(xf))} is again an adapted Markovian process and Ft = o"(J"o V 
a(E(xt),xt-E(xt))). 

The problem [MMV — IR) can be reformulated as a mean-filed type of linear quadratic 
optimal stochastic control problem, 

T 

{MMV-MF) max at{4E(xt) - ptE[(xt - E(xO)'] }, 

t=i 

s.t. E(x() satisfies dynamic equation ([5]), 

xt+i — E(xt) satisfies dynamic equation ([6|), 
E(ut-E(ut)) = 0, t = 0,1,--- ,r-i. 

Similarly, problem (L(c<j)) can be reexpressed as 

(L - MF{oj)) max E(xr) - ujt"&[{xt - E(xt))^] 

r-i 

-Y,^t{^{xt-n^t)?] -at{n^t)-htf], 

t=l 

s.t. E(xt) satisfies dynamic equation (l5|), 

xt+i — E(xj) satisfies dynamic equation Q, 
E(ut-E(ut)) = 0, t = 0,l,--- ,T-1. 

In the above two formulations of a mean-field type, the corresponding problems become separable 
linear quadratic optimal stochastic control problems in the expanded state space (E(xf),xt — 
E(xt)) with the second control vector ut — E(uf) being constrained by a linear equation. 
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3 Optimal Policies for Multi-period Mean- Variance Portfolio 
Selection with and without Intertemporal Restrictions 



Lemma 1 (Sherman- Morrison formula) Suppose that A is an invertible square matrix and 
fi and v are two given vectors. If 



1 + u'A-^n 0, 



then the following holds, 



{A + nv') ^ = A , ^ . 



Lemma 2 Let Bt = E(P^)E-i(PtPj)E(Pt). Then 

[E(PtP;)-E(Pi)E(p;)]"'E(Pt) 



E-i(PtPOE(Pt) 



1-B, 



Proof. Applying Sherman-Morrison formula gives rise to the following, 

-It 



[E(PtP;) - E(POE(P;)] 'E(Pt 



, E-i(PtPl)E(Pi)E(Pl)E-i(P*Pt) 



E-i(PtP;) + 

E-"HP/PnE(P, 
l-Bt 



l-E(POE-i(PtP;)E(Pt) 
E-i (PtPDE(Pt)E(P9E-i (Pf PO 



l-Bf 



E(Pt) 
E(Pt) 



□ 



Consider the following separable multi-period control problem, 

TT-l 

^ ht{xt,vt) + hrixT) 



max E 



t=o 



s.t. xt+i = f{xt,vt), t = 0,l,--- ,T -1, 

where xt denotes the state, vt denotes the control, f{xt,Vt) represents the dynamics of the state 
and ht{xt,vt) is concave in Vf. Based on the principle of optimality in dynamic programming, 
the optimal control at time t is derived from the following recursion of dynamic programming, 

Vf = axgmajc{E[Jt+i{xt+i;vo,vi, ■ ■ ■ ,vt)\Tt] +ht{xt,vt)}, 

Vt 

where J't is the information set at time t, {vq, f i, • • • , vt) is the control sequence before time t + 1 
and 



Jt+i{xt+i;vo,vi,- ■ ■ ,vt) = max E 

Vt+l,--- ,fT-l 



T-l 

hj{xj,Vj) + hr{xT) 

3=t+l 



t+l 



is the benefit-to-go function at time t + \. 
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Lemma 3 Assume that 

E[Jt+i{xt+i;vo,vi,--- ,vt)\J't] = Gl{xt;vo,vi,--- ,vt) + G'f{xt;vo,vi,- ■ ■ ,vt), 
where E[Gf{xt; vq, vi, - ■ ■ , Vt)\To] = holds for any admissible {vq, vi, - ■ ■ , Vt). Then 
^^r = argmax [G]{xt]VQ,vi, ■ ■ ■ ,vt) + ht{xt,vt)] , 

Vt 

Jq{xo) = max \^G]{xt\VQ,vi,- ■ ■ ,Vt)\Fo] + ^W^hj{xj,Vj)\jFQ\ 

vo,— ,vt I ' 

i = o,i,--- ,r-i, 

i.e., G\{xt\ vq^vi, - ■ ■ , V*) + ht{xt, v^) can be regarded as the beneGt-to-go function at time t. 

Proof. Based on the principle of optimality of dynamic programming, the optimal control 
sequence on or before time t + 1 is determined by 

{vo,vl, ■■■ ,vl) = aTgmax\E[Jt+i{xt+i;vo,vi, ■ ■ ■ ,ut)|Jb] + u^)! Jb] ^ 

Thus, we have 

{vo,Vi,--- ,Vt) 

= argmax<^ E[E[Jt+i{xt+i;vo,vi, ■ ■ ■ ,vt)\Tt]\TQ] + y^E[hj{xj,Vj)\TQ] 
vo,--,vt I ^r-( 
^ J=0 

= argmax<^ E[Gl{xt;vQ,vi, ■ ■ ■ ,vt) + Gt{xt;vo,vi, ■ ■ ■ ,vt)\To] + y^E[hj{xj,Vj)\To] 
vo,— ,vt I 

^ J=0 

argmaxi E[Gl{xt;vo,vi, ■ ■ ■ ,Vt)\J='o] + y^E[hj{xj,Vj)\J='o] 

vo,— ,Vt I ' 

j=0 

= argmax] E[- ■ ■E[E[Gl{xt;vo,Vi, ■■■ ,vt) + ht{xt,vt)\J^t-i] + ht-i{xt-i,vt-i)\I't-2] ■ ■ ■ \J^o 
vo,---,vt L 

+ hoixo,vo)Y 

which implies 

= aTgmax{Gl{xt;vo,vi,--- ,vt) + ht{xt,vt)} . 

Vt 

Since E[Gf{xt; vo,vi, - ■ ■ , vt)|^o] = holds for any admissible {vo,vi, - ■ ■ ,vt), we have 
Jo(xo) = max \E[Jt+i{xt+i;vo,vi,- ■ ■ ,vt)\I'o] +y^E[hj{xj,Vj)\I'o] 

vo,— ,vt I ^—^ 

max \E[Gl{xt;vQ,vi,--- ,vt)\TQ] + y^E[hj{xj,Vj)\To]>- 

f?n . ■ ■ ■ .11+ I ' * I 

j=0 ^ 

□ 



^'o,--- ,vt 



Remark 1 Please note that if ht{xt,vt) = ht{xt), i.e., ht is independent of control vt, the 
conclusion of Lemma [3| can he expressed as follows, 



vl = argmax G\{xu vq.vi,--- , vt), 

Vt 



Jo(xo) = max \'&[G\{xt;vo,vi,- ■ ■ ,vt)\FQ] + 'S^Mhj{xj)\FQ] 

vo,-,vt [ ^ 

t = 0,l,--- ,T-1, 

i.e., Gl{xt;vo,vi, ■ ■ ■ ,v^) + ht{xt) can he regarded as the heneht-to-go function at time t. 

In this section, we reconsider the classical multi-period mean- variance model in Li and Ng |16j 
and the multi-period mean- variance model with intertemporal restrictions, {MMV — MF), in 
Costa and Nabholz [IQj under a mean-field formulation. Before presenting our main proposition, 
we define the following backwards recursions for pt and qt, 

pt = atpt + s'^{l - Bt)pt+i, ( qt = adt + stQt+i, 
Pt = dTPTi \ Qt = ot^t, 

for t = T — 1, T — 2, • • • ,1. We also set n0(") = 1 ^^'^ I^0(") ~ ^ ^'^^ convenience. 

Proposition 1 The optimal strategy of prohlem (MMV — MF) is given hy 

u* - E(u*) = -st{xt - E(xt))E-i(PtP;)IE(Pt), (7) 
= ^IE-HPtPOE(PO 
^ 2pt+i l-Bt ' ^ ' 



for t = 0,1, ■ ■ ■ ,T — 1, where the optimal expected wealth level is 

t-1 t-1 „ t-1 

Si. 



k=o j=o ^^^^ ^ e=j+i 

Proof. We first prove that, for information set Tt = cr{To V a{¥,{xt),xt — K{xt))), we have 
the following expression, 

T-1 2 

Jt{E{xt),Xt-E{xt)) = -pt{xt-E{xt)f + qtK{xt) + T^^i' (9) 

j=t 

as the benefit-to-go function at time t. 

When t = T, expression ([9]) is obvious. Assume that we have expression Q as the benefit-to- 
go function at time t + 1. We prove that expression ([9]) still holds for the benefit-to-go function 
at time t. For given information set J^t, i.e., {E{xt),xt — E{xt)), the recursive equation reads as 

Jt{E{xt),xt-Eixt)) 

= - atpt{xt - E{xt))'^ + atitE{xt) + max E[Jt+i{E{xt+i), xt+i - E{xt+i)) \Tt]. 

(E(ut),ut-E(ut)) 
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Based on dynamics ([5]) and we deduce 

E[Jt+iiEixt+i),xt+i-E{xt+i))\Tt] 

T-1 2 

=E[-pt+i{xt+i-E{xt+i)f + qt+iE{xt+i)\j't]+ ^ ^ 

j=t+i 



- Pt+iE[sl{xt -E{xt) f + (P't{ut -E{ut))) + (EK)(Pi-E(Pi) 
+ 2st{xt - Eixt))P't{ut - E{ut)) + 2st{xt - Eixt)) {P[ - E(P[))E(ut) 

+ 2(ut-E(ut))'Pt(P;-E(P0)E(ut) +qt+i[stEixt)+E(P'^)Eint)]+ ^ 



j=t+i 



Since both — E(u() and E(uj) are J-^-measurable and Pt is independent to J-t, we have 



P^Ui-E(ui)) 
E(u;)(Pt-E(PO 



(ut-E(ut))'E(PtP;)(ut-E(ut)), 



-Ft 



E(u;)(E(PtP;) -E(Pi)E(P;))E(ut) 



2st[xt - E(xt))P;(ut - E(ut))E(uO Ft = 2st[xt - E(xt))E(PO (u* - E(ut)) 



0, 



2(uf -E(ut))'(E(PtP;) -E(Pt)E(P;))E(ut), 



E 
E 
E 

E\2st{xt - E{xt)) (P; - E(P;))E(ut) 
E[2(ut - E(ut))'Pt(P; - E(P;))E(ut) 

which further imphes, 

E[J(+i(E(xt+i),Xi+i -E(xt+i))|j;] 
= - vt^x Yt - ^(^*))' + ("* - E(ut))'E(PtPO (ut - E(ut)) 

+ 2st(xt -E(xt))E(P^)(ut -E(ut))] -pt+iE(u;)(E(PtPO -E(Pt)E(P;))E(ut) 

T-1 2 

+ (ZmE(Pl)E(ut) + str7t+iE(xt) + J]] -^B, 



4Pi+i ' 



+ 2(ut - E(ut))'(E(PtPO - E(Pt)E(PO)E(ut) 
--G\ (E(xt) , xt - E(xt ) ; E(ut ) , ut - E(ut) ) + (E(xt) , - E(xt) ; E(ut ) , - E(ut)) , 



where 



GKE(xt),xt - E(xt);E(ut),Ut -E(ut)) 
= - pt+i _ E(xt))' + (ut - E(ut))'E(PtPO (ut - E(ut)) 

+ 2st(xt - E(xt))E(P;)(ut - E(ut))] -pt+iE(u;)(E(PtPO - E(Pt)E(PO)E(ut) 

T-1 2 

+ (?t+iE(P;)E(ut) + stgt+iE(xt) + V -^B, 
Gf(E(xt),xt -E(xt);E(ut),ut -E(ut)) = 2(ut - E(ut))'(E(PtP^) - E(Pt)E(P;))E(ui). 
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Note that any admissible (E(ut), uj — E(ut)) satisfies E(ut — E(uf)) = 0, which imphes 

E[G? (E(xt), xt - E(xt); E(ut), - E(ut)) | Jq] 
=2E[(ut - E(uO)'(E(PtPO - E(Pt)E(P;))E(ut)|7-o] = 0. 

Using Lemma E] and Remark [H we get 

(E(u*),u* -E(u*)) = argmaxGti(E(xt),xt -E(xt);E(uO,Ut -E(ut)). 

By means of Lemma [H we deduce 

G\ (E(xt), - E(xt); E(ut), - E(ut)) 
= - pt+i{s?(l - Bt){xt - E(xt))' + [(ut - E(ut)) + st(xt - E(xt))E-i(PtP;)E(P;) 

• E(PtP;) [(uj - E(ut)) + st{xt - E(xt))E-i(PtPj)E(P;)] } 



■Pt+i 



l-Bt 



(E(PiP;)-E(Pi)E(P;)) 



E(ui^ V f V t; 



2p, 



't+i 



l-Bt 



+ ^l±^Bt + Stqt+iE{xt)+ V 



T-l „2 



i=t+l 



5.- 



Thus, 



u* - E(u*) = -st{xt - E(xt))E-i(PtPOE(P 



t), 



qt+i E-i(P,POE(Pt 
2pt+i 1 - 5t 



where the hnear constraint E(uJ' — E(uJ')) = automatically holds. Therefore, based on Remark 
[H we have 



Gl (E(xt), Xt - E(xt); EK), u* - E(u*)) - atPt{xt - E{xt))^ + aAE(xt) 

T-l 2 

= -pt{xt - E(xt))' + qtEixt) + V T^^i 



as the benefit-to- go function at time t. 

Substituting the optimal expected portfolio strategy dS]) into dynamics we further deduce 
the following recursive relationship of the optimal expected wealth level, 

qt+i Bt 



E(xt+i) = stE{xt) + 



2pt+i l-Bt' 



which implies 



t-i 



t-i 



K{xt) = xqJ] sk+y^^ 

k=o j=o 



n 



2Pi+i ^-Bj ^^^^^ 



□ 
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The optimal strategy obtained in Proposition [T] covers the exiting results in the literature as 
its special cases. 

Case 1: Let la = {T}, ar^T = 1, aTPT = ujt > 0- Then, we have 



T-l 



T-l 



Pt = UJT 



'J' 



which further implies 



t-l T-l t-l T-l 

A;=0 k=t j=0 t=j 



A;=0 
T-l 



k=t 



1 i-nr=o(i--gfc) 



\[IZI{1-B,) 

Therefore, we have 



■ T-l 



k=0 



T-l 

n 

J fc=t+i 

(10) 



which is the optimal portfolio strategy obtained in Li and Ng [16]. 
Substituting ([8]) and (flOl) to dynamics ([6]) yields 

E{xt+i - E{xt+i)f = s2(l - Bt)E{xt - E{xt)Y + ^ 



B, 



nl=Li4i^-Bk) 44nr=/(i-i?.)' 



which further implies 

E{xT-E{xT)y=Y^ n 4(1 -^fc 



T-l T-l 



1 



B, 



j=0 k=j+l 

T-l T-l 



n 



T-l a 



k=j+l •'fc 

T j=0 k=j 
,2 TlT-li 



(l-Bk) 4a;2nL/(l-B,) 



4a;- 



Thus, the efficient frontier is given by 

tT-1 



Var(xr) = E{xt - E(xr))' = ^'!=^[\~ ^ U{xt) Sk) for E(xt) > xq J] 

l-llfc=o(l-^fe) V fc=o ^ ifc=0 



which is the same as the efficient frontier established in Li and Ng |16] . 
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Case 2: Let Iq, = {ri, • • • , Tq,} with = T. Then we have the optimal portfoho strategy as 
fohows, 

u* = -stXtE-^PtP'tM^t) + stE{xt)E~\PtP[)E{Pt) + ^ HFtP'tM'Pt) ^ ^^^^ 

2pt+i I- Bt 

where 

pt = atpt + s1{l - Bt)pt+i, t = Ti, ( qt = atit + stqt+i, t = Ti, 
pt = Sf (1 - Bt)pt+i, Ti-i <t <Ti, < qt = stqt+i, U-i < t < n, 
PT = axPT, [ qr = ar^T, 

and 

which is the same as the result developed in Costa and Nabholz [lOj. Note that Costa and 
Nabholz originally studied a market consisting of all risky assets in their investigation. When 
we introduce a riskless asset into the market, parameters of Qi, Si, Ai and Vi defined in (22), 
(23), (28) and (29), respectively, in Costa and Nabholz |10j have been modified to 

r o <: /I TT -n ^~llfc=r, - Bk) qr,+^ 
g^ = -2p^^, S, = -qr„ Ai= [[ Sk, V, = —-—^ • 



4 Generalized Mean- Variance Strategy with Risk Control Over 
Bankruptcy 

In this section, we reconsider the generalized mean- variance model with risk control over bankruptcy 
in Zhu et al. [28] under the mean-field framework, i.e., we consider problem (L — MF{bj)) first. 
For t = T — 1, T — 2, • • • , 1, we define pt, rjt and as follows, 

pt = u:t + s1{l - Bt)pt+i, ( r]t = ^tat + sfCt+iVt+i, / ^* = -^tath + stCt+i6+i, 

PT = UJT, 1 ?/T = 0, \ ^r^ = l^ 

where Lagrangian multiplier > and 

_ Pt+i{l-Bt) + 2rit+iBt 
Pt+^(l-Bt)+r^t+^Bt - ' 

due to 1 > -Bt > 0. Then, it is obvious that > and rjt > 0. 

Lemma 4 Suppose that pt+i > and 'qt+i > hold. Then 

[pl+iE(P,PO - (pt+i +r?t+i)E(Pt)E(PO]-^E(P,) = - ^n^T?^^'^ - 

Pt+i{l - Bt) + r]t+iBt 
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Proof. Applying Sherman-Morrison formula (Lemma [T]) yields 
[pt+iECPfF't) - (pt+i +r?i+i)E(POE(P;)]-'E(Pt) 



E-\PtP'tmPt) 
pt+i{l - Bt) + r]t+iBt' 



i-PtliiPt+i + m+i)n^'t)^-\^tP't)E{Pt) 

p^l,E-~\PtP[){pt+^ + 77i+i)E(POE(POpr+ilE-i(PtPD 



EiPt 



EiP, 



Proposition 2 The optimal strategy of problem (L — MF{uj)) is given by 



u* - E(u*) = -st{xt - E(:Et))E-i(PtP;)E(P 



t), 



E(u* 



^t+i + r]t+iStE{xt) 



E-^(PtP;)E(Pt), 



*^ Pt+i{l - Bt) + rit+iBt 
where the optimal expected wealth level E{xt) evolves according to 

ij+iBj 



□ 

(12) 
(13) 



t-i 



t-i 



E{xt) = xq\[ Ck+iSk + ^ 



Pj+iil - Bj)+r]j+iBj 



k=0 j=O^J^^'- ■" ' 'J^^ l=j+l 

Moreover, the optimal objective function of {L — MF(u)) is 



(14) 



T-l 



H{uj) = r/iCiSo^o + 2^iCiSoXo + ^ 

j=0 



Pj+i{l - Bj) + rjj+iB^ 



(15) 



with u}Q = aQ = bo = 0. 



Proof. We first prove that for information set Tt = (t{Fo V a{E{xt),xt — E{xt)))-, we have the 
following expression, 



Jt{E{xt),xt-E{xt)) 
-- - pt{xt - E{xt)f + Vt{E{xt)f + 2CtE{xt) + ^ 



T-l 



Pj+i{l - Bj)+r]j+iBj 



, (16) 



as the benefit-to-go function at time t. 



When t = T, expression (I16p is obvious. Assume that expression (|16p holds at time t + 1 as 
the benefit-to-go function. We show that expression (jl6p still holds for the benefit-to-go function 
at time t. For given information set Tt, i-e., {E{xt),xt —E{xt)), applying the recursive equation 
yields 

Jt{E{xt),xt - E{xt)) = - ujt{xt - E(xt))^ + ujtat{E{xt)f - 2ujtatbtE{xt) + uJtatbf 

+ max E[Jt+i{E{xt+i),xt+i -E{xt+i))\Tt] . 
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Based on the dynamics in ([5]) and ([6]), we have 

E [Jt+i(E(xt+i),xt+i -E(xt+i))|j-t] 
=E [-pt+i{xt+i-Eixt+i))^ + r]t+i{Eixt+i)f + 2^t+iEixt+i)\Tt 

T-l 

sr~^ , ,2 



--Bj) + f?i+l5J• 
-p^+lE[s2(a;^ _ E{xt)f + (P't{ut - E{ut))y + (EK)(Pt - E{Pt) 
+ 2st{xt - E(xt))P;(ut - E(ut)) + 2st{xt - E(xt)) (P^ - E(P;))E(ut) 
+ 2{ut- E(ut))'Pt(P; - E{P[))E{ut)\Tt] + m+i Utn^t) + E{V't)E{vit) 



+ 2e 



t+i 



T-l 

stE(xt)+E(POE(ut)] + 

j=t+i 



— — — + WjOj-ft • 

Pj+i{l- Bj) + r}j+iBj ^ 



Similar to the proof of Proposition [H we have 

E [Jt+i{E{xt+i),xt+i-E{xt+i))\Tt] 
= - Pt+i [s? {xt-E{xt)f+ {ut - E(ut)) 'E(PtPO {ut -E{ut)) 

+ 2st{xt - E{xt))E(P[){ut - E{ut))] - E(uO [pt+iE(PtP;) - {pt+i + r]t+i)E(Pt)E(P[)^ E(ut) 
+ (2^+1 + 2r]t+iStEixt))E{P[)Eiut) + r?t+is2(E(xt))2 + 2^t+iStEixt) 

T-l 

+ E 

j=t+i 



+ 2(ut - E(ut))'(E(PtPO - E(Pt)E(Pj))E(ut) 



=GliE{xt), xt - E{xt);E{ut),ut - E{ut)) + GliE{xt),xt - E(xt); E(ut), - E(ut)), 
where 

Gl{E{xt),xt -E{xt);E{^t),ut -E{ut)) 
= - pt+i [s^t{xt - E(xt))' + {ut - E(ut))'E(PtP;) {ut - E{ut)) 

+ 2st{xt-E{xt))E(P't){ut-E{ut))j - E(u;) [pt+iE(PtP;) - {pt+i + r]t+i)E(Pt)E(P't)\E{ut) 
+ i2^t+i + 2r]t+iStEixt))E{P',)Eiut) + r]t+isUE{xt)f + 2Ct+iStE{xt) 



T-l 

E 

j=t+i 



— — — + ujjajb] 



GUHxt),xt - E{xt);E{^t), ut - E(ut)) = 2{ut- E(ut))'(E(PtPO - E(POE(PO)E(ui). 
Note that any admissible (E{ut), ut — E{ut)) satisfies E{ut — E(uf)) = 0, which implies 

E[G^E{xt),xt - E{xt);E{ut),ut -Eiut))\To] = 0. 
Using Lemma [3] and corresponding to Remark [H we get 

(E(u*), u* - E(u*)) = argmaxG^ {E{xt),xt - E(xt); E(ut), - E{ut)) . 
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By means of Lemma [2l we deduce 

GjiEixt),xt - E{xt);E{ut),ut - E{ut)) 
= - Pt+iE{sUl - Bt){xt - E{xt)f + [{ut - E{ut)) + st{xt - E{xt))E-\PtPt)'E(Pt)] 'n^tP't 
{ut - E{ut)) + st{xt - E{xt))E-\PtP't)E{P[)] } 
Ct+i + Vt+iStE{xt) 



E{ut) 
E{ut) - 



- Bt) + r]t+iBt 
Ct+i + Vt+iStE{xt) 1 



E-\PtP't)E(Pt) 



Pt+i{l - Bt) + rjt+iBi 



E-\Pt'P't)E{Pt) 



+ 



Pt+iE{PtP't) - [pt+i + r,t+i)E{Pt)E{P't) 
{it+i + r]t+iStE{xt)f 



pt+i(l - Bt) + rjt+iB^ 



■Bt 



T-l 



+ Tit+is^tiHxt))^ + 26+iSiE(xt) + 

j=t+l 



Pj+ii^ - Bj) + Vj+iBj 



ujjajbj 



Thus, 



u* - E(u*) = -st{xt - E{xt))E-\PtP't)E{Pt), 



Ci+i + r]t+iStE{xt) 



*^ pt+i{l - Bt) + 7]t+iBt 
which satisfies the Unear constraint E{ut — E{ut)) = 0. 



E-i(PiPOE(Pt), 



Based on Remark [H we can find 

G'J(E(xt),xt -E(xt);E(u*),u* -EK)) 
- ujt{xt - E(xt))^ + uJtat(E{xt))'^ - 2uJtatbtE{xt) + utatbt 

T-l 

= - pt{xt - E{xt))^ + Vt{E{xt))^ + 26E(xt) + 



Pj - Bj) + rij+iBj 



+ ujjajb] 



as the benefit-to-go function at time t. 

Substituting the optimal expected portfoHo strategy (fTSll into dynamics 

£,t+iBt 



gives rise to 



E{xt+i) = Ct+iStE{xt) + 



Pt+i{l - Bt) + rjt+iBt' 



which imphes 



t-l 



t-l 



E{xt) = xo'[\_ Ck+iSk + Y 



ij+iBj 



k=0 



j=0 



Pj+i{l - Bj) +r]j+iB. 



t-l 

n c 

^ e=j+i 



e+ise- 



Noting that lhq = = bg = 0, then the expression of the optimal objective function of 
(L - MF{uj)) is obvious. □ 



Substituting (fT2]) and (fTSl) to dynamics ([6]) yields 



E{xt+i - E{xt+i)f = s}{l - Bt)E{xt - E{xt)f + 



(6+1 + r]t+iStE{xt))'^ 
{pt+i{l - Bt) + rit+iBt 



:{Bt-B, 
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which leads to the foUowing expression of the variance of the optimal wealth level, 



\av{xt) = E{xt-E{xt)y = Y, 



j=0 



t-i 



e=j+i 



Zhu et al. [28j analyzed the Lagrangian problem (L(w)) via the embedding scheme. They, 
however, do not succeed to obtain an analytical form of the optimal objective value function 
H{(jj). Thus, they proposed the prime-dual algorithm to solve the following dual problem of 
{GMV) numerically, 

min H{oj). 

In this paper. Proposition [2] does not only derive an analytical policy but also successfully 
reveal the explicit form of H(uj). Thus, a simple steepest descent algorithm can be directly 
applied to derive the optimal Lagrangian multiplier vector to*, due to the convexity of H(uj) 
(see [28|). Then the optimal strategy of (GMV) can be presented by the portfolio strategy in 
Proposition [2] with uj = uj*. Therefore, our new mean- field formulation clearly, yet powerfully, 
offers a more efficient and more accurate policy scheme again in this situation, when compared 
to the existing literature. 

Example 1 Consider an example of constructing a pension fund consisting of S&P 500 (SP), 
the index of Emerging Market (EM), Small Stock (MS) of U.S market and a bank account. 
Based on the data provided in Elton et al. fl3^ . the expected values, variances and correlations 
of the annual return rates of these three indices are given in TableUl 



Table 1: Data for the asset allocation example 





SP 


EM 


MS 


Expected Return 14% 


16% 


17% 


Variance 


18.5% 


30% 


24% 


Correlation 


SP 


1 


0.64 


0.79 


EM 




1 


0.75 


MS 






1 



We further assume that the annual risk free rate is 5% (st = 1.05 ) and consider a five-period 
generalized mean-variance model with risk control over bankruptcy, i.e., a (GMV) problem. 
Then, E(P(), Cov(P() and E(P(Pj) can be computed as follows, for t = 0,1, ■ ■ ■ ,4, 



E(Pt 



0.09 
0.11 
0.12 



Cov(Pi) 



0.0342 0.0355 0.0351 
0.0355 0.0900 0.0540 
0.0351 0.0540 0.0576 



E(PtPt] 



0.0423 0.0454 0.0459' 
0.0454 0.1021 0.0672 
0.0459 0.0672 0.0720 



Assume that an investor has initial wealth xq = 1 and trade-off parameter ws = 1. The 
disaster level and the acceptable maximum probability of bankruptcy are chosen as bt = and 
at = 0.10, respectively, for t = 1,2, 3, 4. 
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To solve the dual problem of (GMV) and get the optimal Lagrangian multiplier vector u* , 
we consider the following unconstrained problem, 



min H(uj) 



1=1 



where X]i=i l^sl'^*) barrier function used to ensure uj € M^, /i is the barrier parameter 

and H{(jj) satisfies <f75)) . Theoretically speaking, by setting // | 0, we can derive the optimal 
Lagrangian multiplier vector. Using the steepest descent algorithm, we get 



OJ 



[0.0014, 0.2658, 0.2543, 0.0014]'. 



We further have 



'Pl 




"0.9854" 




'vi 




"0.0615" 




"6" 




"0.6200" 




"Ci" 




"1.0168" 


P2 




1.1364 




m 




0.0550 








0.5828 








1.0130 






1.0054 


1 


V3 




0.0256 




^3 




0.5513 




Ca 




1.0069 


Pa 




0.8673 




V4 




0.0001 








0.5250 




C4 




1.0000 


^5. 




1 




Jl5. 









es 




0.5 




C5 




1 



The optimal expected wealth levels are then given by 

E{xo) = 1, E(xi) = 1.2366, E(2;2) = 1.4537, E(x3) = 1.6856, E(x4) = 1.9353, E(x5) = 2.1687. 
Therefore, according to Proposition [2 the optimal strategy of (GMV) is specified as follows. 



* 

Uo = 


(-1.05x0 + 1.9197)K, 


* 

Ul = 


(-1.05x1 +2.0218)K, 


* 

U2 = 


(-1.05x2 + 2.2688)K, 


* 

U3 = 


(-1.05x3 + 2.5409)K, 


* 

U4 = 


(-1.05x4 + 2.6687)K, 



where 



K = E-i(PtP;)E(PO 



1.0580 
-0.1207 
1.1052 



Finally, the variances of the optimal wealth levels are given as 
Var(xi) = 0.1275, Var(x2) = 0.1986, Var(x3) = 0.2648, Var(x4) = 0.3295, Var(x5) = 0.3536. 



We can further get the efRcient frontier of (GMV) by adjusting the trade-off parameter cjs 
from to +00, which is represented by the dash dot line in FigureU^ In the figure, the solid 
curve above is the efEcient frontier of the classical five-period mean-variance model, which is 
plotted for a comparison purpose. 
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15 




Variance 

Figure 1: Efficient frontiers of (MV) and {GMV) 

5 Conclusions 

The nonseparable multi-period mean-variance and related problems have been solved in the 
literature via embedding scheme, Lagrangian formulation or mean-variance hedging problem. 
However, we may not be able to derive optimal value functions of these transformed problems 
analytically, especially, when some constraints are attached to the problem setting. Hence, we 
often need to invoke some numerical algorithms to compute the corresponding best auxiliary pa- 
rameter or Lagrangian parameter. In this paper, we adopt the mean-filed formulation, as a more 
efficient means, to directly tackle the nonseparable multi-period mean- variance portfolio selec- 
tion model, multi-period mean-variance model with intertemporal restrictions, and generalized 
mean- variance model with risk control over bankruptcy. Under this newly proposed framework 
of mean-field formulations, we are capable of deriving analytical solutions for all these problems, 
thus improving the solution quality and facilitating the solution process. 
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